Transcription Cost Reduction for Constructing Acoustic Models Using Acoustic Likelihood Selection Criteria

نویسندگان

  • Tomoyuki Kato
  • Tomiki Toda
  • Hiroshi Saruwatari
  • Kiyohiro Shikano
چکیده

This paper describes a novel method for reducing the transcription effort in the construction of task-adapted acoustic models for a practical automatic speech recognition (ASR) system. We have to prepare actual data samples collected in the practical system and transcribe them for training the task-adapted acoustic models. However, transcribing utterances is a time-consuming and laborious process. In the proposed method, we firstly adapt initial models to acoustic environment of the system using a small number of collected data samples with transcriptions. And then, we automatically select informative training data samples to be transcribed from a large-sized speech corpus based on acoustic likelihoods of the models. We perform several experimental evaluations in the framework of ‘Takemarukun’, a practical speech-oriented guidance system. Experimental results show that 1) utterance sets with low likelihoods cause better task-adapted models compared with those with high likelihoods although the set with the lowest likelihoods causes the performance degradation because of including outliers, and 2) MLLR adaptation is effective for training the task-adapted models when the amount of the transcribed data is small and EM training outperforms MLLR if we transcribe more than around 10,000 utterances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Selective training of HMMs by using two-stage clustering

This paper proposes a method of constructing acoustic models from training data clustered in two stages. In the first stage, training data from a target task are clustered and generate GMMs for each cluster. The second stage uses the GMMs to select training data from a large-scale database based on the GMM likelihood. MAP estimation adapts an acoustic model for each cluster using the selected t...

متن کامل

Efficiency Assessment of Acoustic Cabin for Providing Acoustic Comfort in Turbine Unit of a Thermal Power Plant

Background and Objective: A practical method for noise control in environments with different noise sources is designing an acoustic cabin for the workers. In this regard, this study aimed to assess the efficiency of the acoustic cabin in a typical turbine unit of a thermal power plant to provide acoustic comfort. Materials and Methods: Measurement of the noise level and spectrum, as well as v...

متن کامل

Investigation on acoustic behavior of acoustic porous absorbers to ‎absorb sound energy and transmission loss index

In this study, the acoustic properties of porous absorbents with different porosity levels have been evaluated using different mathematical models. These models use one or more parameters of materials for calculating acoustic characteristics. In all of these models, materials are considered as equivalent fluid and reactionary characteristics have not been taken into account.

متن کامل

Effect of porosity on the characteristics of underwater acoustic sound absorbers using theoretical models‎

Porous materials have good acoustic damping characteristics over a wide frequency range. As for sound waves, many small-scale pores in the coating materials can convert underwater-coating to rough surfaces. The main property of porous absorbents is their resistance against incident sound wave that leads to damping effect. From a physical point of view, damping occurs due to friction between flu...

متن کامل

Utterance Verification Using State-Level Log-Likelihood Ratio with Frame and State Selection

This paper suggests utterance verification system using state-level log-likelihood ratio with frame and state selection. We use hidden Markov models for speech recognition and utterance verification as acoustic models and anti-phone models. The hidden Markov models have three states and each state represents different characteristics of a phone. Thus we propose an algorithm to compute state-lev...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006